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ABSTRACT 



We report on results from the first solar Fitting at Low-Angular degree Group (solar 
' FLAG) hare-and-hounds exercise. The group is concerned with the development of meth- 

5^ I ods for extracting the parameters of low-/ solar p mode data ('peak bagging'), collected by 

Sun-as-a-star observations. Accurate and precise estimation of the fundamental parameters of 
the p modes is a vital pre-requisite of all subsequent studies. Nine members of the FLAG (the 
'hounds') fitted an artificial 3456-d dataset. The dataset was made by the 'hare' (WJC) to sim- 
ulate full-disc Doppler velocity observations of the Sun. The rotational frequency splittings 
of the I - \,2 and 3 modes were the first parameter estimates chosen for scrutiny. Significant 
differences were uncovered at Z = 2 and 3 between the fitted splittings of the hounds. Evi- 
dence is presented that suggests this unwanted bias had its origins in several effects. The most 
important came from the different way in which the hounds modeled the visibility ratio of 
the different rotationally split components. Our results suggest that accurate modelling of the 
ratios is vital to avoid the introduction of significant bias in the estimated splittings. This is of 
importance not only for studies of the Sun, but also of the solar analogues that wiU targets for 
asteroseismic campaigns. 
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X George Isaak passed away on June 5th, 2005, prior to the completion of 
this work. He is greatly missed by us all. 



1 INTRODUCTION 

The solar Fitting at Low- Angular degree Group - solar FLAG - is 
an international collaboration of helioseismologists. Its basic aims 
are to actively develop new, and refine existing, techniques for anal- 
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ysis of low-angular degree (low 1) solar p mode data collected by 
unresolved, Sun-as-a-star observations. Observations of this type 
take a weighted average, over the visible disc, of either Doppler ve- 
locity or intensity perturbations. This means they are very sensitive 
to perturbations from the low-/ modes. 

There are currently four active observational Sun-as-a-star 
programs: the ground-based Birmingham Solar-Oscillations Net- 
work (BiSON) 30-yr database, including the Mark I instrument 
on Izafia] ; and the Global Oscillations at Low-Frequency (GOLF) 
and VIRGO/SPM instruments on board the ESA/NASA SOHO 
spacecraft [both 10-yr database]. A full 11 -yr activity cycle of 
observations is also available to the helioseismology community 
from the recent officially completed ground-based IRIS program 
(although two of the sites continue to collect data). 

The unprecedented levels of S/N and resolution achieved in 
these long datasets are demanding that a fresh look be taken at 
how the basic parameters of the modes (e.g., frequency, frequency 
splitting, power, damping etc.) are extracted. When conflicting re- 
sults are given from analyses made by different groups on different 
datasets, it is hard to know from where the disagreement originates. 
It may have a contribution from differences in the analysis codes, 
or the slightly different way in which each instrument measures 
the signatures of the p modes. A highly coordinated approach, like 
FLAG, in which different codes are tested on common datasets, 
offers a constructive way forward. Similar approaches are extant 
elsewhere in helioseismology — for example in the domain of local- 
area seismology (The Local Helioseismology Comparison group, 
LoHCo'). 

Helioseismic parameter extraction is usually accomplished by 
fitting the resonant structure of the modes, in a power spectrum 
of the observations, to complicated multi-parameter models, which 
seek to represent, as accurately as possible, the underlying struc- 
ture present. Analysis of this type is usually grouped under the 
catch-all heading of 'peak bagging'. Refinement of the Sun-as-a- 
star peak-bagging is the main concern of FLAG. Those improve- 
ments to technique and application that come from FLAG will of 
course also have relevance for analysis of asteroseismic time se- 
ries of other Sun-like stars (Bedding & Kjeldsen 2003). We envis- 
age including as part of FLAG'S activities investigations targeted 
specifically to asteroseismic problems. Put another way, the group 
has (implicitly) helio-VLAG and a.s/ero-FLAG components to it. 

Activity in FLAG has centered on a hare-and-hounds exer- 
cise. Artificial Sun-as-a-star data, spanning 3456 simulated days - 
a length roughly commensurate to one 11 -year cycle of solar ac- 
tivity - were generated by WJC (the 'hare'), with a S/N per unit 
time characteristic of that in the GOLF and BiSON Doppler veloc- 
ity data (Chaplin et al. 2003a). These FLAG data were then fitted 
by nine group members (the 'hounds'). A-priori information, of the 
sort available for real observations, was passed to the hounds, but 
nothing more. The hounds were then asked to return to the hare, 
for subsequent collation and comparison, estimates of the underly- 
ing parameters of the low-degree modes. Estimates of the rotational 
frequency splittings were the first parameters selected for scrutiny, 
and are the subject of this paper. It is worth adding that a careful 
eye was kept on the other parameters during analysis to check for 
possible complications from cross-talk. 

The layout of the rest of the paper is as follows. In Section|5| 
we deal briefly with how FLAG data were constructed. In Sec- 
tion |3| we summarize commonalities in, and differences between. 
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the strategies for parameter fitting adopted by the hounds. Results 
are discussed at length in Sections |4| and |5| We pay particular at- 
tention to the effects on the fitted splittings of the modeling of mul- 
tiplets containing at least three observable components; the ranges 
in frequency over which the low-/ pairs were fitted; and a-priori 
choices made regarding modeling of peak width. Section |6| sum- 
marizes the results, noting implications for future analyses of he- 
lioseismic and asteroseismic solar-analogue data. 



2 FLAG DATASET 

FLAG datasets were constructed component by component in the 
time domain. Datasets were made to contain simulated low-/ modes 
in the ranges < / < 5 and 1000 < v < 5500 yuHz. A database 
of mode frequency, power and line width estimates, obtained from 
analyses of GOLF and BiSON data, was used to guide the choice 
of input values for time-series construction. 

The Laplace transform solution of the equation of a forced, 
damped harmonic oscillator was used to generate each component 
at a 40-s cadence in the time domain, in the manner described by 
Chaplin et al. (1997). Components were re-excited independently 
at each sample with small 'kicks' drawn from a Gaussian distribu- 
tion. The model gave rise to peaks in the frequency power spectrum 
whose underlying shapes were Lorentzian. It should be pointed out 
that the observed low-/ peaks are slightly asymmetric in shape (al- 
beit at the level of only a few per cent at most). 

The hypothetical FLAG instrument was assumed to make its 
observations from a location in, or close to, the ecliptic plane. This 
is the perspective from which BiSON (ground-based network) and 
GOLF (on the SOHO spacecraft at LI) view the Sun. The rotation 
axis of the star is then always nearly perpendicular to the line-of- 
sight direction, and only a subset of the 2/ + I components of the 
non-radial modes are clearly visible: those having even l + m, where 
m is the azimuthal order. These components were represented ex- 
plicitly in the complete FLAG time series. The visibility for given 
m and / also depends, though to a lesser extent, on the spatial fil- 
ter of the instrument (e.g., Christensen-Dalsgaard 1989; see also 
brief discussion in Section|3|below). Here, we adopted BiSON-like 
visibility ratios (see Chaplin et al. 2001). 

The predominant contribution to the frequency splitting of the 
p modes arises from the internal rotation of the Sun. This effect can 
be parameterized as an odd function of the azimuthal order, m, and 
therefore acts to lift the frequency degeneracy in / to give a sym- 
metric pattern in which the synodic separation between adjacent 
azimuthal orders (i.e., for |Am| = 1) is 0.4/iHz. Other agents, such 
as magnetic fields, that are not sensitive to the sign of m (i.e., cannot 
distinguish east from west) can also contribute to the splitting (e.g., 
Goossens 1972), but they do so in such a way as to introduce asym- 
metries in the arrangement of the components within each multiplet 
(even functions of m) 

It is at times when the level of activity on the Sun is low that 
the splittings of the low-/ modes are very nearly symmetric. At 
moderate to high levels of activity, the peaks are instead distributed 
unevenly in frequency. This is because the solar-cycle changes to 
the frequencies depend on both / and m, due to the spatially non- 
homogeneous, and time-dependent, nature of the driver for the 
changes: the near-surface activity. For the Sun-as-a-star data, asym- 
metry only becomes apparent in the frequency spacings for / > 2, 
when modes have more then two visible peaks. The asymmetry 
varies as the solar activity cycle waxes and wanes. 

In the first FLAG time series, we imposed a symmetric fre- 
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quency splitting of 0.4 yuHz on the non-radial modes (i.e., a spacing 
of O.SyuHz between adjacent visible components in this Sun-as-a- 
star dataset). The splitting value was the same for all modes, irre- 
spective of overtone number and angular degree, and was chosen 
to match the observed average (synodic) value extracted from real 
data (e.g., see Garcia et al. 2004). 

The second FLAG dataset was made to simulate the effects 
of the activity cycle. We introduced systematic changes in mode 
frequency having an appropriate underlying (/, m) and frequency 
dependence. The frequency spacings of the non-radial modes, as 
seen in the power spectrum of the full time series, were therefore 
not symmetric. However, the resulting asymmetry in the frequency 
spacings, v;j,„|=/ - v; |,„|=/_2, was smaller than the simulated contribu- 
tion of the rotation, with an average value of about Q.IjjUz (e.g., 
Chaplin et al. 2003b). 

The frequency changes were introduced into the modes in the 
second dataset via appropriately scaled variation of the oscillator 
characteristics of each component (sec. for example, Chaplin et al. 
2004a). A spherical harmonic decomposition of Kitt Peak Magnetic 
Index (KPMI) data, taken over the 3456-d period beginning 1992 
January, was used to calibrate the relative sizes of the changes for 
each (/, m). This was done from full-disc averages, as a function of 
time, of the harmonic components of the KPMI. Solar-cycle-like 
changes were also introduced in the second set in the component 
damping constants. These affect not only line width but also the 
power in the modes. Observational evidence for {I, m)-dependent 
changes to the widths of the low-/ modes is currently marginal 
at best (e.g., Chaplin et al. 2004b). As such, activity-dependent 
changes in the FLAG dataset were the same for all modes (regard- 
less of frequency, /, or m). 

To complete both datasets, a background noise component 
was also added in the time domain, the power of which increased 
at lower frequencies (a second-order polynomial in 1/v) in order 
to give signal-to-background ratios commensurate with GOLF or 
BiSON-like Doppler velocity data. Two artificial datasets were con- 
structed by the hare, each of length 3456 d. In one, values of the 
underlying mode parameters were held constant in time, while in 
the other some solar-cycle-Uke effects were introduced. 

The FLAG time series used for this paper are freely available 
at http : //bison . ph . bham . ac . uk/ " w j c/Resear ch/dataFLAG . html . 



3 FITTING STRATEGIES OF THE HOUNDS 

The time series were made available, with 100-per-cent duty cy- 
cles, for the nine hounds to fit (TA, FB, STF, RAG, SJJ-R, ML, 
DS, TT and RW). A priori information was limited to: the cadence 
and length of the datasets; and the calibration and format of the 
FLAG dataset. For the purposes of this study we chose not to im- 
pose an observational window function on the data (e.g., that from 
a ground-based network). This allowed us to test parameter extrac- 
tion under the more 'benign' conditions afforded by a continuous 
set of observations. 

All the hounds fitted multi-parameter models to the resonant 
structure in a power spectrum of the complete time series. Differ- 
ences in the adopted fitting strategies, and approaches to coding the 
algorithms, had the potential to give variation in the fitting param- 
eters. Numerical inconsistencies from the second source are subtle 
and hard to pin down. Those from the first are in principle easier 
to identify since elements of each fitting strategy can be listed and 
compared. 

A dataset of Sun-as-a-star observations gives a single time se- 



ries whose frequency spectrum will contain many closely spaced 
resonant peaks. Parameter estimation must contend with the fact 
that the various m in the multiplets lie in very close frequency prox- 
imity to one another. Suitable models, which seek to describe the 
characteristics of the m present, must therefore be fitted to the com- 
ponents simultaneously. Furthermore, overlap between modes ad- 
jacent in frequency is a cause for concern over much of the low-/ 
spectrum. Each hound therefore took the approach most often used 
for Sun-as-a-star data, and fitted modes in pairs (1 = 2 with neigh- 
bouring / = 0; and / = 3 with / = 1). All but one did so by isolating 
narrow fitting 'windows' centred on the target pairs. Chosen win- 
dow sizes varied from 40 to 50 //Hz for the even-/ pairs, and 40 to 
60 yuHz for the odd. 

The multi-parameter models to which the modes are fitted then 
usually have in them contributions from the components of the tar- 
get pair, and an additional, flat offset to represent the pseudo-white, 
non-resonant background (which varies only very slowly with fre- 
quency in the range of interest). The models, however, then do not 
take into account that small fraction of power in the vicinity of the 
target pair that comes from nearby, weak / = 4 and 5 peaks, and also 
the slowly-decaying tails of the other even and odd-pair modes in 
the spectrum. These extra contributions were allowed for by one 
of the hounds, who fitted two pairs at a time. Some of the hounds 
then went to a mode-by-mode approach at very low frequencies 
(typically below about ISOOjuHz), where the narrower peak widths 
reduce substantially the overlap from peaks of adjacent modes (an 
eff'ect also called 'mode blending'). 

While the majority varied parameters in their fitting models 
simultaneously (until the fits converged), two hounds adopted a 
multi-step iterative procedure (e.g., see Gelly et al. 2002). This in- 
volved holding a few parameters constant during some iterations, 
the intention being to reduce fitting cross-talk between strongly cor- 
related parameters. 

Further differences were present in the detail of the numbers, 
and types, of parameters the hounds sought to estimate by fitting. 
For example, some attempted to fit for peak asymmetry, where in- 
dividual components were in all cases represented by the asym- 
metric formula of Nigam & Kosovichev (1998); while others, af- 
ter preliminary analysis had indicated none was present, fixed the 
asymmetry at zero and submitted revised estimates from a (simpler) 
Lorentzian-based fitting model. There was also a division between 
those hounds who constrained the widths of all components of the 
/ = 2/0 or 3/1 pairs to be the same (again, common practice at low 
/); and those who instead fitted two widths, one for each ensemble 
of components in a mode. However, of all the potential sources of 
discrepancy, the modeling of peak visibility in multiplets was foimd 
to be potentially the most significant. 

When / > 1 a mixture of \m\ is observed. Since the visibility 
filter of the observations is strongly \m\ dependent, a suitable strat- 
egy must be adopted to allow for this. If H{1, m, n) is the height of 
any given component as it appears in the power spectrum (i.e., the 
maximum power spectral density), the height ratio e(/, n) for / = 2 
and 3 multiplets is defined to be: 

e(2, n) = H(2, 0, n)/H(2, |2|, n), (1) 

e(3,n) = H(3, |1U)/H(3, |3|,n) (2) 

respectively. The underlying ratios used to construct the FLAG data 
were e(2, n) = 0.54 for all / = 2 modes, and e(3, n) = 0.38 for all 
/ = 3 modes. The values were chosen by WJC to match those found 
in the BiSON data. 

It is common practice to include the height ratio as some fixed 
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parameter in the fitting. However, if tiie assumed value does not 
match that of the actual data, the fitted frequency splitting will be 
affected. That this is a potentially serious problem for Sun-as-a-star 
analysis was first discussed at length by Chaplin et al. (2001). They 
showed that when the adopted value of e{l,n) is larger than the 
true, underlying value the splittings will tend to be overestimated; 
while the splittings will tend to be underestimated when e(/, n) is 
too small. The bias that is introduced is more severe at high fre- 
quency. 

The hounds were given no a-priori information regarding the 
underlying height ratios in the FLAG data. As noted in Section|2| 
in real observations the exact ratios are determined by the spatial 
filter response of the instrument. The filter will be different for in- 
tensity and Doppler velocity observations; what is more, it will also 
differ from Doppler instrument to Doppler instrument, due to, for 
example, differences in the way the target Fraunhofer line is sam- 
pled to determine the Doppler signal, and the width of Fraunhofer 
line itself (Doppler imaging). With information on the nature of 
the observation, it is then possible to calculate a theoretical set of 
height ratios (e.g., see examples in Christensen-Dalsgaard 1989; 
Appourchaux et al. 2000a). It is worth adding that making the de- 
termination as realistic as possible is non-trivial, and as such is not 
without its problems. 

Although the hounds could have been given filter response 
information for the hypothetical FLAG instrument, we chose in- 
stead to explore the importance for the final results of letting the 
hounds make their own decisions. All adopted the strategy of fit- 
ting with fixed height ratios of their own choosing. These varied 
from e(2, n) = 0.50 to 0.60 at Z = 2, and from e(3, n) = 0.19 to 0.60 
at / = 3. 

Finally, all hounds restricted fits for the splitting to frequen- 
cies below about 4000 yuHz. This was because of the known prob- 
lems of estimating the parameter at high frequencies, where there 
is substantial overlap between adjacent peaks (e.g., Chaplin et al. 
1998, 2001; Appourchaux et al. 1998; 2000b). At 4000 yuHz the ra- 
tio of the separation in frequency of adjacent m to the peak width 
is ^ 0.08. The splittings are as a result both strongly overestimated, 
and poorly constrained. (See Garcia et al. 2004, and Chaplin et al. 
2004c, for discussion of the contribution of high-frequency split- 
tings to rotation inversions.) 



4 RESULTS 

The curves in the left-hand panels of Fig. Q show the rotational 
splitting estimates of each of the hounds, extracted from the FLAG 
dataset with no temporal parameter variations. Since the prominent 
outer components, with / = \m\, dominate the fitting (see, for ex- 
ample, discussion in Chaplin et al. 2004a), the fitted splittings are 
close to the sectoral mode splittings: 

6v,{l,n) = , (3) 

The fitted splittings are to be compared with the input value 
(5Vj(Z, n) = 0.4/iHz present in all modes with / > 1. The right-hand 
panels show the formal uncertainties associated with the splittings. 
All hounds extracted these in the same manner, taking, for each 
fit, the square root of the corresponding diagonal element of the 
inverted Hessian fitting matrix. 

A very similar pattern of splitting results was obtained from 
fits to the second, solar-cycle-modulated dataset. Here, the fitting 
codes had to cope with an underlying arrangement of m peaks at 



/ > 2 that was slightly asymmetric in frequency. It is possible to 
extract the multiplet asymmetry from long Sun-as-a-star datasets 
(e.g., Thiery et al. 2001; Chaplin et al. 2003b, 2004a; Garcia et al. 
2004); and, by implication, the artificial FLAG datasets here. How- 
ever, the estimates are poorly constrained because the asymmetry 
is typically smaller in size than (5v,(/, n); and the inner components 
[having (/, \m\) of (2, 0) and (3, 1)] are less prominent, and harder to 
distinguish, than the outer sectoral peaks. It should come as no sur- 
prise that since the outer m dominate the fitting the returned split- 
tings are hardly alfected by choosing to fit to a model that takes 
no account of the multiplet asymmetry (the type of model fitted 
by the hounds). An added benefit is that, with fewer parameters to 
fit, the extracted splittings are also better constrained. Because of 
the similarity of the results from the two datasets, we show in the 
remainder of the paper results obtained from the first, 'stationary' 
FLAG dataset only. 

As seen in Fig.0 the hounds' results for / = 1 were quite ac- 
curate over most of the fitting range. Moreover, the different curves 
shared many features. This revealed the extent to which fits were 
influenced strongly by the common realization noise, in spite of 
differences in fitting strategies and execution. WJC made fits to 
further independent realizations of the stationary FLAG dataset, 
which confirmed the features did not have their origin in another 
effect. It is worth emphasizing that while 'features' from the real- 
ization noise are common between hounds, they are nevertheless 
unpredictable from dataset to dataset. The / = 2 and 3 curves also 
had common features resulting from the noise. 

The now familiar overestimation of the Sun-as-a-star splittings 
was present at higher frequencies in the hounds' data (e.g., Chaplin 
et al. 2001). This resulted in mode blending, and was most pro- 
nounced at / = 1 , since at this degree the sectoral components lie in 
closer proximity than at / = 2 and 3. 

Let us now turn our attention to the differences between the 
hounds. To determine the significance of these differences we sub- 
tracted the input splitting of 0.4 /iHz from all fitted estimates. Each 
splitting residual was then divided by the mean uncertainty for that 
mode, the uncertainty being an average across the hounds. The cal- 
culated difference residuals ~ in units of the typical, fitted uncer- 
tainty - are plotted in the left-hand panels of Fig.|5|(same line styles 
as previous). The various curves therefore show how the results of 
the hounds differed from one another. The rms and absolute, ex- 
treme deviations of the difference curves are plotted in the right- 
hand panels. 

Each splitting has an uncertainty arising from the intrinsic 
(stochastic) noise on the data, and it is the formal uncertainties 
from the fitting that seek to provide an estimate of this internal 
scatter. The formal uncertainties might not, however, provide an 
accurate measure of the full, external precision in each parameter. 
Indeed, differences in the results of the hounds represent an addi- 
tional source of uncertainty for the splittings, over and above that 
arising from the intrinsic noise. This extra source of error can be 
estimated from the rms curves in Fig.|2| 

The RMS values at / = 1 were typically but a small fraction 
of the fitting uncertainties, indicating the fitting uncertainties were 
a reasonably good indicator of the actual precision of the splitting 
estimates. Analysis of fits to additional realizations of the FLAG 
dataset (made by WJC) bore out the conclusion. This was man- 
ifestly not the case at / = 2 and 3. Here, the splitting curves of 
the hounds departed noticeably from one another, although the 
'shapes' did remain similar (from the common, stochastic noise). 
The corresponding rms values were comparable to, or larger in size 
than, the internal fitting uncertainties over much of the fitting range. 



solarFLAG hare and hounds 5 




1=1 



0,60 

0.55 

_ 0.50 

^ 0.45 
c 

5 0.40 
" 0.35 
0.30 
0.25 



1500 2000 2500 3000 3500 4000 
Frequency {/.iHz) 




1500 2000 2500 3000 3500 4000 
Frequency {/^Hz) 

«=3 




1500 2000 2500 3000 3500 4000 
Frequency {/.iHz) 



.S 0.100 - 



0.001 



1.000 ^ 




1500 2000 2500 3000 3500 
Frequency {,iiHz) 

1=2 



3- 



.S 0.100 r- 



0.010 ^ \ 




1500 2000 2500 3000 3500 
Frequency {,i^Hz) 

1=3 



4000 



.S 0.100 - 



0.001 




1500 2000 2500 3000 3500 
Frequency {,itHz) 



Figure 1. Fitting results of tlie liounds (various curves). In tlie left-hand panels: extracted splitting estimates; and in the right-hand panels: the associated, 
formal uncertainties. 



At / = 3, extreme differences for some modes exceeded the 6a" 
level. 

The sharp, low-frequency increase in the Z = 1 and 3 rms 
curves merits a brief comment. The modes in the simulated datasets 
were extremely long lived at these frequencies and the S/N ratios 
were very low (to match the real observations). 

Some poor-quality fits made in this regime, which fail to give 
accurate estimates of the mode parameters, will return greatly over- 
estimated linewidths and large splitting uncertainties. That said, 
any one of these fits will tend not to give an out-of-line datum 
on the normalised difference plot, owing to the large fitting er- 
ror. Those fits that 'latch' on instead to prominent spikes from the 
broadband background will, since the associated uncertainties will 



be very small. The low-frequency parts of the / = 1 and 3 curves in 
Fig.|2|show the presence of occasional poor fits that are of this sec- 
ond type, having the expected small splitting uncertainties (because 
the fitted noise spikes are sharp). This suggests that estimates of 
the frequencies and splittings of very low-frequency modes should 
therefore not always be regarded as being accurate (which is usu- 
ally, implicitly, taken to be the case). 

5 ON THE DIFFERENCES BETWEEN HOUNDS 

To understand the differences uncovered in Figs.0and|5|we tested, 
in turn, the impact of each of the main strategy choices adopted by 
the hounds. 
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Figure 2. Left-hand panels: Fitted minus input splittings for each of the hounds, divided by mean fitted uncertainty. Right-hand panels: rms (solid curve) and 
extreme, absolute deviations (dotted curve) of the residual difference curves in the left-hand panels. 



The first issue we considered involved a simple a-priori 
choice: should one constrain all components in a mode pair to have 
the same fitted width? The second and third strategy options in- 
volved instead a free choice of real- valued constraints in the fitting 
models. These were: first, to fix the w-component height ratios; and 
second, to fix the size of the fitting window. The potential for a con- 
tinuum of unwanted bias was therefore present, with the size of bias 
dependent on the values of the imposed model parameters. 

One issue we did not address in detail was the impact of peak 
asymmetry, and attempts to fit for that asymmetry. The mechanism 
for generation (in the time domain) of the FLAG data gave peaks in 
the frequency domain that were Lorentzian. However, observations 
of the low-/ modes show the presence of small amounts of asym- 



metry. Our work, in particular fitting the FLAG data to models with 
and without asymmetry, raised some interesting questions regard- 
ing the impact of the asymmetry parameter. Artificial time series 
data, which include resonant asymmetry, are needed to properly ad- 
dress this issue, i.e., data for which the asymmetry characteristics 
may be modified on a datum-by-datum basis. This is the subject of 
ongoing work, and will be left to a future paper. 

It is possible to recover fairly accurate estimates of the in- 
put splittings from serendipitous combinations of the fitting con- 
straints, i.e., the effects of bias from the three targeted strategy 
choices may just happen to cancel fortuitously. The optimal, and 
desirable, combination is clearly one in which bias from all choices 
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is minimized. This cross-talk should be borne in mind in what fol- 
lows. 

Fig. |3| gives a concise snapshot of effects contributing to the 
disagreement at Z = 2 and 3, where the significance of the differ- 
ences is a potential cause for concern. To combine these on a single 
plot, we show splitting differences, in units of the typical fitting un- 
certainty, given by several changes of fitting strategy. Rather than 
use data from all hounds, we chose to use results from a single 
peak-bagging code (the pair-by-pair code of WJC), with the vari- 
ous fitting options activated. It is worth adding that almost identical 
results were given by the pair-by-pair codes of SJJ-R and TT. 

5.1 Modeling peak width 

The dashed lines in each panel of Fig. show the (normalized) 
shift in splitting given by changing from fitting a single width for 
all components in a pair, to two widths, one for each mode (in the 
sense two-parameter minus single-width-parameter results). The 
sets of parameters needed to make these curves were returned from 
fits where the correct height ratios, e{l, n), were adopted, and 50- 
yuHz-wide fitting windows were applied. Evidently, different com- 
binations of imposed constraints, and output results, were possible. 
However, the plotted data give a reasonable guide to the typical 
changes given by changes to the width strategy 

The width strategy choice is a familiar one for those fitting 
Sun-as-a-star data. Fig. |3| suggests the choice has an insignificant 
impact on the 1 = 3 splittings. At / = 2, the etfect is quite modest 
over most of the frequency range, and only exceeds 1 cr at higher 
frequencies, where freeing the widths of each pair makes the fitted 
splittings rise. Here, the shift is not due to inaccurate modeling of 
the underlying widths: damping coefficients in the stationary FLAG 
dataset were only frequency, and not (/, m), dependent. Changes in 
width over narrow ranges in frequency were very small; a single 
width was therefore the better fitting option for each FLAG pair. 
Indeed, the width-strategy results suggest fitting with constrained 
widths may be a good choice for real data. However, our tests took 
no account of the possible impact of dependence of the widths on 
/ and m, or of a differing solar-cycle signature. [Variation of the 
shifts, with I and m, is observed in data of medium I (Komm, Howe 
& Hill 2002), for which it is possible to achieve much higher pre- 
cision than at low /, where the evidence for degree dependence is 
only tentative (Chaplin et al. 2003c).] These are obvious avenues 
for future FLAG work. 



5.2 Modeling of m-component heiglit ratios 

Each of the hounds set the height ratios, n), at fixed values prior 
to fitting. The solid curves in Fig. |3| show the changes in splitting 
given by going from a low to high fixed ratio (in the sense high 
minus low data). To make the curves, we took e{l, n) at the extreme 
ends of the ranges used by the hounds. This meant values of 0.5 and 
0.6 at Z = 2, and 0.2 and 0.6 at Z = 3. Clearly, a higher value of the 
ratio lead to a larger fitted splitting. The point is made more fully, 
and the ranges of potential bias are revealed, in Fig.|4| It shows re- 
sults of refitting the FLAG spectrum (again using the code of WJC) 
with the height ratios at values ranging from 0. 1 (splitting estimates 
indicated by lower, solid line in each panel) to 0.7 (uppermost, bro- 
ken line in each panel), in uniform steps of 0. 1 . Similar results were 
obtained from an independent analysis by RAG. 

The range of selected ratios, and the resulting potential for bias 
in the fitted splittings, was evidently larger at Z = 3 than at Z = 2 (as 



evidenced by the solid curves in Fig.|3}. Indeed, simple inspection 
of the hounds' data in Fig. Q shows that some of the Z = 3 fits 
had been strongly affected by the chosen 6(3, n) (for example the 
uppermost two curves and the bottom curve in the lower left-hand 
panel of Fig.0. 

The correlation between the fitted splittings of the nine hounds 
and the fitting height ratios they chose is shown in Fig.|5| At Z = 2 
the correlation is seen to be rather marginal. In the light of the re- 
sults in Fig.|21this was only to be expected: changes to the splitting 
from the choice of e(2, n), while probably the largest source of bias 
in play, are not expected to be significantly larger than the shifts 
arising from the other changes of strategy. However, the expecta- 
tion at Z = 3 is that height-ratio selection will dominate over the 
other choices. The higher values of correlation bear this out. 

To formally verify the impact on the fitting, the hounds were 
asked to refit the FLAG data with the input height ratios in their 
models. Fig.|5|shows the differences in splitting that resulted for a 
sample of three of the hounds, in the sense: results given with as- 
sumed ratios minus those given with the actual input ratios, with 
the originally chosen ratios indicated in the plot titles. While the 
Z = 2 splittings of all hounds were affected by the re-analysis the 
changes were, as expected, not noticeably significant. The lower 
panel of Fig. |6|was the most extreme case. This resulted in there 
being only a modest impact on the Z = 2 rms residual difference 
curve in Fig.|2|(middle right-hand panel). The original choices for 
e(3,;i) had been rather more varied, and re-fitting brought signif- 
icant changes. Indeed, the scatter between hounds was all but re- 
moved at Z = 3 (as shown in Fig.0 confirming this effect as the 
dominant source of disagreement. 

It is possible to fit for the height ratio. However, high intrinsic 
resolution is needed to extract usable estimates of the parameter. 
In the solar case, this demands time series of length approximately 
one year or more. Even then, robust estimates can only be extracted 
from a certain range in frequency. Furthermore, other parameters 
may then also be less well constrained. 

The lower-frequency modes are in principle best suited to the 
task, since their narrow peaks allow for a clear separation of the 
constituent m. However, S/N in the power spectral density of the 
modes falls off rapidly at low frequency. The less prominent, in- 
ner components may as a consequence be too weak to extract an 
estimate of the ratio if modes at too low a frequency are selected 
for scrutiny. At frequencies above about ^ 3100;uHz the increased 
severity of mode damping compromises the reliable estimation of 
the underlying e(Z, «), since the wide peaks overlap. 

Chaplin et al. (2001) found that in the solar case a suit- 
able range to attempt the extraction was 1800 ^ v < 3000 yuHz over 
which one of the pertinent parameters here - the ratio of the sep- 
aration in frequency of adjacent m to the peak width - varies 
from ^ 3.5 to ^ 0.9. An average of the fitted values, over the 
several orders in n covered by the selected frequency range, will 
then give reasonably well constrained estimates of the underly- 
ing ratios. For example, WJC obtained average estimates, (6(Z)), 
of <e(2)) = 0.57 ± 0.04 and <e(3)) = 0.41 ± 0.02 from fits to the 
FLAG data. Both values lie within one sigma of the input values. 

5.3 Clioice of fitting window 

The dotted lines in each panel of Fig.^show representative exam- 
ples of the impact on the splittings of changes to the fitting window 
size. Here, the plotted shifts are between fits with 50 and 40-yuHz- 
wide windows (in the sense wide-window minus narrow-window 
data). The effect of the window size on the pair-by-pair approach 
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Figure 3. Impact of several effects on the splittings extracted at / = 2 and 3. Each line shows splitting differences, in units of the typical uncertainty, between 
two strategies. Differences are between fitting with: high and low values of e(/, n), with values of 0.6 and 0.5 at / = 2, and 0.6 and 0.2 at / = 3 [solid line]; 50 
and 40-yuHz-wide windows [dotted]; and two and one width parameters for a pair [dashed]. (See text for more details.) 
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Figure 4. Impact of fitting with different fixed ;)i-component height ratios. The code of WJC was used to generate the results. Values adopted for e(/) ranged 
from from 0.1 (giving data described by lower solid curve in each panel) up to 0.7 (uppermost, solid curve in each panel) in uniform steps of 0.1 (the 
intermediate curves). The input values were 0.55 at / = 2 and 0.38 at / = 3. Shown in the left-hand panels: the extracted splittings. Shown in the right-hand 
panels: fitted minus input splittings, divided by corresponding mean fitted uncertainties. 



turns out to be most severe if, as is usually the case, leakage from 
neighbouring / = 4 and 5 modes is not allowed for explicitly in the 
fitting model. 

Analyses of Sun-as-a-star data taken in Doppler velocity con- 
centrate, quite rightly, on the most prominent modes present in such 
data — those with / < 3. However, with the advent of longer, higher- 
quality Doppler velocity datasets it is now possible to extract rea- 
sonably well-constrained estimates of some parameters of / = 4 



modes (e.g., Chaplin et al. 1996; Lazrek et al. 1997), and even some 
of / = 5 (Lazrek et al. 1997) [the latter being most conspicuous in 
the near-continuous GOLF dataset]. These modes are nevertheless 
very weak in comparison to their lower-Z counterparts. In BiSON 
data, the maximimi power spectral density of the most prominent 
/ = 4 and 5 modes is typically ^ 40 and ^ 300-times lower, re- 
spectively, than in nearby / = modes. To ensure robustness in 
fits attempted on the sextupole and decupole modes, these higher / 
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Figure 5. Pearson correlation coefficient between the fitted splittings and the adopted n) of the hounds. 
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Figure 6. Impact on extracted splittings of three of the hounds of re-fitting with the correct height ratios, e{l, n) (0.55 at / = 2 and 0.38 at / = 3). Plotted are 
differences between: fitted splittings given with originally selected height ratios (in plot titles) and splittings given with the correct ratios. The 1=1 splittings 
are affected slightly by cross-talk in the fits. 



are usually given a simplified representation in the fitting models, 
for example by including only the more prominent \m\ = 4 and 5 
components. The point here is that, although these modes are weak, 
their presence may bias the results of any fits that fail to account for 
the contribution they make to the power in the fitting window. 

The FLAG dataset includes 1 = 4 and 5 modes, and, indeed, 
one of the hounds allowed for their presence during fitting. The 
other hounds adopted the more usual approach, and did not. The 
potential influence of this on the fitting can be understood from 
the placement in frequency of the 1 = 4 and 5 modes with respect 
to the even (/ = 2/0) and odd (/ = 3/1) pairs. This positioning 
is shown in the echelle-like plots in Fig.|S| The solid lines in the 
left-hand panel bound regions in frequency where the / = (upper 



triplet of lines) and 1 = 2 modes (lower triplet) are most prominent. 
The boundaries actually mark locations that are one linewidth of a 
mode (at that frequency) beyond the location of the outer m = ±1 
peaks. The dotted and dashed lines do the same, respectively, for 
the / = 4 and 5. The right-hand panel is instead centred on the / = 1 
(upper triplet) and 1 = 3 modes (lower triplet). Inspection of the 
plots indicates that the potential for leakage from higher / is clearly 
a bigger cause for concern for the even-degree pairs, because of the 
closer proximity of the 1 = 4 and 5 peaks. 

Fig.|9|reveals the impact on the extracted splittings of chang- 
ing the fitting window size. TT performed fits with window sizes 
that ranged, in uniform increments of 5 yuHz, from 35 to 65 yuHz. 
These results bear out the simple prediction from Fig.[8| The 1 = 2 
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Figure 7. Overall impact at / = 3 of re-fitting with the correct height ratios. Shown in the left-hand panel: the re-fitted splittings of the nine hounds; and in the 
right-hand panel: the rms scatter, in units of the typical formal uncertainty, between hounds. These data are to be compared with those of the original fits in 
Fig.0(bottom left-hand panel) and Fig.|5|(bottom right-hand panel). 
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Figure 8. Location, in frequency, of / = 4 and 5 modes with respect to their more prominent / = 2/0 and 3/1 counterparts. The solid lines bound regions 
in frequency where the / = and 2 modes (left-hand panel) and / = 1 and 3 modes (right-hand panel) are most prominent. Boundaries mark locations one 
linewidth of a mode (at that frequency) beyond the location of the outer m = ±1 peaks. The dotted and dashed lines do the same, respectively, for each of / = 4 
and 5. 



splittings are noticeably affected in the range above ^ 2500 /jHz, 
where the increased power in the neighbouring / = 4 and 5 peaks 
means the leakage is most severe. In contrast, the odd-pair 1=1 and 
3 splittings show little change. The relative importance for each de- 
gree is similarly reflected by the significance of the dotted curves 
in Fig.|3l 

The analysis above, and range of fitting windows actually cho- 
sen by the hounds, suggested that while the window-size effect 
would have contributed at / = 2 to the splitting differences, it was 
probably not the largest acting; the biggest effect probably came 
from the height-ratio selection. 



6 DISCUSSION 

Nine members of the FLAG group (the hounds) analyzed an artifi- 
cial 3456-d dataset constructed by the hare (WJC). The data were 
made to mimic seismic, Sun-as-a-star observations in Doppler ve- 
locity. Fits were made by the hounds to the low-Z resonant peaks 
in a power spectrum of the full time series, under the favourable 
conditions offered by a 100-per-cent duty cycle of simulated obser- 



vations. Estimates of the multiplet frequency splittings of the 1=1, 
2 and 3 modes were returned to the hare for further scrutiny. The 
nature of the data are such that these splittings are close to the sec- 
toral splittings of the modes. 

All nine sets of splittings were affected at high frequencies 
by the well-known effects of mode blending. This gave rise to an 
overestimation in the splittings, which was most severe at / = 1 . It is 
at this degree that the outer components are most closely spaced in 
frequency, and the associated splitting uncertainties are as a result 
also largest for the dipole modes. 

Comparison of the splittings revealed differences between the 
hounds, and several sources of bias were identified and studied. 
This bias turned out to be of least significance at / = 1, where the 
RMS scatter between the splittings of the hounds was but a small 
fraction of the typical fitting uncertainties. Departure of the fitted 
splittings from the input values was dominated at / = I by real- 
ization noise, and a variety of fitting strategies and imposed fitting 
constraints gave similar results. 

Scatter between hounds was in contrast more severe at / = 
2 and 3, where the lower uncertainties meant bias from several 
sources came into play. The simpler case to explain was / = 3. 
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Figure 9. Impact of imposing different sizes of fitting window. The code of TT was used to generate the results. Window sizes ranged from 35 (giving data 
described by lower solid curve in each panel) up to 65 yuHz (uppermost, solid curve in each panel) in uniform steps of 5 fiUz (the intermediate curves). Shown in 
the left-hand panels: the extracted spUttings. Shown in the right-hand panels: fitted minus input splittings, divided by corresponding mean fitted uncertainties. 



Here, variation in the m-component height ratios used by hounds 
in their fitting models was the dominant source of bias. This vari- 
ation gave rise in the first sets of submitted splittings to rms scat- 
ter between hounds at a level, over much of the frequency range, 
of twice the typical fitting uncertainties (and extreme difi'erences 
of more than 6cr). Re-fitting, with the correct height ratios, all but 
eliminated these differences. 

At Z = 2, differences arose at the rms level of Icr. These dif- 
ferences came from a combination of eff^ects. The largest was prob- 
ably from the aforementioned height ratio choice. Other bias came 
from the chosen size of fitting window, and from a-priori choices 



over fitting strategy (e.g., how to constrain the fitting widths in a 
pair). 

What are the implications for future analysis of Sun-as-a-star 
data? First, accurate modeling of the m-component height ratios is 
vital to avoid the introduction of potentially significant bias in the 
fitted splittings. This can be done either by accurate and precise 
modeling of the instrumental response function, or by estimation 
of the ratio by fitting. A careful strategy will largely remove any 
bias at / = 3. At / = 2, the size of the fitting window also matters. 
Sensitivity to this can be reduced by making allowance in the fitting 
model for the presence, in the fitting window, of power from nearby 
/ = 4 and 5 peaks, and 'background' from the slowly decaying tails 
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of the other even and odd-pair modes. With bias from the height 
ratio and window effects largely removed, the choice of how to 
model widths in a pair contributes only at liigher frequencies. 

It is also worth commenting on implications for asteroseismic 
analysis on solar analogues. Differences in internal rates of rotation, 
convective properties, and angles of inclination offered by stars all 
present their own set of fitting problems. Faster-than-solar rotation 
can reduce bias compared to the FLAG case, since peaks are then 
more widely spaced in frequency and easier to distinguish. How- 
ever, clear advantages only accrue if the widths of mode peaks are 
not also significantly increased in size. The strength of damping, 
and the width of peaks, is largely dependent on the convection in 
the star. For example, a main sequence star which is hotter than 
the Sun will tend to support more vigorous convection in a thinner 
sub-surface zone; other things being equal, this will lead to heavier 
damping of the acoustic modes, and larger widths in the frequency 
domain. 

The angle of inclination, has a large effect on the observed 
pattern of peaks. The extreme cases, of ; ^ and =^ 90 degrees 
(the latter relevant to the extant helioseismic data), give patterns 
of peaks that are least 'cluttered' . While this can help reduce bias 
from blending of adjacent components, the fact that components 
are missing means that information on the rotation is limited. An 
intermediate angle may result in all components having reasonable 
visibility (provided the background noise is small); however, the 
spacing between observed components is then approximately half 
that in the extreme (' cases. The impact of mode blending is then 
much more severe. 

Our studies indicate that accurate estimation of the height ra- 
tios - which have a strong dependence on / - will be important 
for accurate asteroseismic inference on rotation. Gizon & Solanki 
(2003) and Ballot et al. (2004) have already looked at this prob- 
lem with artificial data. They found it was possible to extract well- 
constrained estimates of i from time series of fairly modest length 
(5 to 6 months), provided stars rotated at least twice as fast as the 
Sun, and ( > 30 degrees. An astero-FLAG investigation on short 
dataset lengths is clearly desirable to fully test the conclusions of 
this paper for realistic asteroseismic scenarios. 
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